Eecient and Eeective Clustering Methods for Spatial Data Mining

نویسندگان

  • Raymond T. Ng
  • Jiawei Hany
چکیده

Spatial data mining is the discovery of interesting relationships and characteristics that may exist implicitly in spatial databases. In this paper, we explore whether clustering methods have a role to play in spatial data mining. To this end, we develop a new clustering method called CLARANS which is based on randomized search. We also develop two spatial data mining algorithms that use CLARANS. Our analysis and experiments show that with the assistance of CLARANS, these two algorithms are very e ective and can lead to discoveries that are di cult to nd with current spatial data mining algorithms. Furthermore, experiments conducted to compare the performance of CLARANS with that of existing clustering methods show that CLARANS is the most e cient. keywords: spatial data mining, clustering algorithms, randomized search

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining Spatial Data via Clustering

Contributions from researchers in Knowledge Discovery are producing essential tools in order to better understand the typically large amounts of spatial data in Geographical Information Systems. Clustering techniques are proving to be valuable in providing exploratory analysis functionality while supporting approaches for automated pattern discovery in spatially referenced data and for the iden...

متن کامل

A Clustering Approach for Large Visual Databases

The representation and organization of images in a large-scale image database is crucial to support eeective and eecient accesses to image data on the basis of content. In this process, signiicant features must rst be extracted from image data in their pixel format. These features must then be classiied and indexed to assist eecient retrieval of image content. In this paper, we investigate eeec...

متن کامل

Hybrid Genetic Algorithms areBetter for Spatial

Iterative methods and genetic algorithms have been used separately to minimise the loss function of many representative-based clustering formulations. Neither of them alone seems to be signiicantly better. Moreover, the trade-oo of eeort vs quality slightly favours gradient descent. We present a unifying view for the three most popular loss functions: least sum of squares, its fuzzy version and...

متن کامل

Using Combined Descriptive and Predictive Methods of Data Mining for Coronary Artery Disease Prediction: a Case Study Approach

Heart disease is one of the major causes of morbidity in the world. Currently, large proportions of healthcare data are not processed properly, thus, failing to be effectively used for decision making purposes. The risk of heart disease may be predicted via investigation of heart disease risk factors coupled with data mining knowledge. This paper presents a model developed using combined descri...

متن کامل

Applying a decision support system for accident analysis by using data mining approach: A case study on one of the Iranian manufactures

Uncertain and stochastic states have been always taken into consideration in the fields of risk management and accident, like other fields of industrial engineering, and have made decision making difficult and complicated for managers in corrective action selection and control measure approach. In this research, huge data sets of the accidents of a manufacturing and industrial unit have been st...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994